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PACKAGE RESOLUTION MECHANISM FOR DATABASE SYSTEMS 



FIELD OF THE INVENTION 

The present invention relates generally to database systems and more particularly 
to a package resolution mechanism in a database system. 

BACKGROUND OF THE INVENTION 

Figure 1 is a block diagram of a database system 10. The system includes an 
application 12 which communicates with a database server 14. The application 12 
includes host language and SQL statements which are utilized to communicate v^th the 
server to obtain the appropriate information from the tables therein. The database server 
14 typically prepares or converts the host language to allow the application to be 
executed (by the server). This preparation or process is referred to as preparing a 
package. A package is typically identified miiquely by an application program identifier 
(application ID) and a collection identifier (collection ID). 

The server includes a memory 20 and optimizer 22. The optimizer 22 provides 
the best access path for the information from tables of the user database for a particular 
package from the plurality of packages 26 in the DB2 catalog 18. The best access path is 
provided via a bind process. The server 14 is in tum in communication with the DB2 
catalog 16 and a user database 18. 

The ability to switch between various packages and provide package resolution on 
a database server is a useful fimction for SQL applications. For example, it may be 
desirable to create multiple collection IDs containing the same package name in order to 
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allow different bind options or SQL access paths to be in effect. Certain database 
management systems such as DB2 Universal Database for z/OS by IBM Corporation 
have offered solutions that involve a list of possible package collections that the SQL 
applications are bound into. 

The database server would use this list to locate a matching package, where the 
list of packages is specified during static bind of the application plan and thus not 
changeable during execution time. Further, execution environments where remote clients 
are connected do not have the ability to run with application plans, thus the ability to 
switch between packages is very limited. 

One solution is for the application to specify to the database server its package 
collection via a SQL statement, referred to as SET CURRENT PACKAGESET. The 
drawback with this solution is that multiple programs within the application need to 
coordinate the setting of the CURRENT PACKAGESET value, and it is difficult to keep 
the correct value in the CURRENT PACKAGESET register of the application at all times 
when the application is complex (e.g. lots of separately compiled programs that make up 
a large application). 

Given a set of packages such as JDBC_COLLECTION, SQLJ_C0LL1 and 
SQLJ__C0LL2 when utilizing the SET CURRENT PACKAGESET statement, a 
programmer would have to code application logic in the following manner when utilizing 
the SET CURRENT PACKAGESET statement: 

SET CURRENT PACKAGESET = "JDBC_COLLECTION" 
perform some JDBC logic 

SET CURRENT PACKAGESET = "SQLJ_C0LL1" 
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run SQLJ logic 

SET CURRENT PACKAGESET = "JDBC_COLLECTION" 
perform more JDBC logic 

SET CURRENT PACKAGESET = "SQLJ_C0LL2" 
Run SQLJ logic 

Accordingly, the application developer is only able to specify a single package 
collection name at any given point in time via the CURRENT PACKAGESET. As the 
application moved from subroutine to subroutine, the application developer had to be 
very careful to set and restore the CURRENT PACKAGESET value properly so that the 
current package collection was in effect at all times. 

With the conventional database system, a plan's package list was managed by the 
client. If the client's package collection list contained "A, B, C, D", the client would flow 
the SQL statement across the network once for each package collection until the correct 
package containing the SQL statement is found. Thus in the worst case, the client would 
send the SQL statement to the server four times before finding a matching package. 

Another problem with the list of package collections associated with each plan is 
observed in the stored procedure and user-defined function execution environment. 
Stored procedures and user-defined functions are often stand-alone sets of programs and 
invoked by multiple callers. However, users cannot identify package resolution schemes 
independently from the rules established by the plan, because the plan's list is always 
used to search packages as the stored procedures are executed. A solution for package 
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management is needed that will allow nested storage procedures or user-defined 
functions to be implemented without concern for the caller's runtime envirormient. 

Accordingly, what is needed is a system and method for package resolution in a 
database system that overcomes the above-identified problems. The present invention 
addresses such a need. 

SUMMARY OF THE INVENTION 

A method and system for providing package resolution in a database system is 
disclosed. The method and system comprise providing a statement within a server which 
allows an application to identify a list of package collections. The method and system 
further includes executing the statement to obtain the appropriate package. 

A system and method in accordance with the present invention provides a new 
current package path value, which is set via a statement referred to as SET CURRENT 
PACKAGE PATH. The SET CURRENT PACKAGE PATH statement allows the 
application to supply a precedence list of qualified package collections during execution 
time, therefore the scheme for resolving packages can be changed at any point during 
execution. The SET CURRENT PACKAGE PATH statement improves application 
development by making the package resolution mechanism less error prone and less 
tedious for the application developer, while providing an optimal-performance solution in 
the client-server environment. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a block diagram of a database system. 
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Figure 2 is a flow chart for a package resolution system in accordance with the 
present invention. 

Figure 3 illustrates a distributed environment where there is a plurality of database 
systems in accordance with the present invention. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates generally to database systems and more particularly 
to a package resolution mechanism in a database system. The following description is 
presented to enable one of ordinary skill in the art to make and use the invention and is 
provided in the context of a patent application and its requirements. Various modifications 
to the preferred embodiment and the generic principles and features described herein will be 
readily apparent to those skilled in the art. Thus, the present invention is not intended to be 
limited to the embodiment shown but is to be accorded the widest scope consistent with the 
principles and features described herein. 

A system and method in accordance with the present invention provides a new 
current package path value in the server, which is set via a statement referred to as Set 
Current Package Path. In a preferred embodiment this statement is an SQL statement. 
One of ordinary skill in the art readily recognizes however that the statement could be in 
a variety of languages and its use would still be within the spirit and scope of the present 
invention. This SQL statement allows the application to identify a precedence list of 
package collections that is used during package resolution. The CURRENT PACKAGE 
PATH value contains a list of collections for packages. The value is maintained and 
updated through the SET CURRENT PACKAGE PATH statement. 
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Figure 2 is a flow chart for a package resolution system in a database system in 
accordance with the present invention. First, a list of a collection of packages are provided 
in a predetermined format is provided, via step 102. Next, a statement is provided in a 
server that allows an application to identify the list, via step 104. Finally, the statement can 
be executed to obtain the appropriate package, via step 106. 

The list of package collections can include literals, host variables, keywords, and 
null string. At the thread level, the list of package collections is built in an array format 
after syntax checking, duplicate values eliminating, and keyword resolution. During 
subsequent SQL statement execution, the database server will use this list to locate the 
package that contains the bound SQL statement in the order of the specified entries. 
Accordingly, the server locates the package rather than the client. For instance, if the 
CURRENT PACKAGE PATH value contains "A, USER, C", the resolved collections are 
"A", "ADMFOOl", "C" where ADMFOOl is the current primary user ID. The search for 
an outstanding package will be performed with using collection A first. If such package 
is not found, a locate will be done with collection ADMFOOl, and so on. 

The package loading process often includes an authorization check, a catalog 
lookup, and a potential storage reading that is very expensive. As packages are located, 
they are cached in memory at each thread level as "found" package entries. The found 
packages can be reused on subsequent SQL statements, or as application logic switches to 
a different package and then back. As searching through the list, the thread also keeps 
track of the packages that were not found, to prevent having to repeatedly look up a non- 
existing package. This keeping track can be implemented by creating "not-found" entries 
in the table of the user database. The not-found cached entries help to minimize look up 
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of entries in the database catalog for a bound package. When searching through the table 
the thread would recognize the not-found entries, skip the loading process for such 
collection, and move on to the next collection. All found and not-found entries in the 
cache have an identifier that can include for example, local server name, collection, 
program name, and timestamp when the program was precompiled. 

As before-mentioned with the conventional SET CURRENT PACKAGESET 
statement the client had to control the list of packages. In addition only a single 
collection package could be specified at any point in time. Once again to repeat the 
example given before given a set of packages such as JDBC COLLECTION, 
SQLJ_C0LL1 and SQLJ_C0LL2 when utilizing the SET CURRENT PACKAGESET 
statement, a programmer would have to code application logic in the following manner 
when utilizing the SET CURRENT PACKAGESET statement: 

SET CURRENT PACKAGESET = "JDBC_COLLECTION" 
perform some JDBC logic 

SET CURRENT PACKAGESET = "SQLJ_COLLr' 
run SQLJ logic 

SET CURRENT PACKAGESET = "JDBC__COLLECTION" 
perform more JDBC logic 

SET CURRENT PACKAGESET = "SQLJ_C0LL2" 

With the CURRENT PACKAGE PATH statement in accordance with the present 
invention, the application developer's job is greatly simplified. To illustrate this 
simplification refer now to the coded application logic below. 
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First the statement that identifies all of the collections in one statement appears 

below: 

SET CURRENT PACKAGE PATH - "SQLJ_C0LL1, SQLJ_^C0LL2, 

^ JDBC_COLLECTION" 

Then the JDBC and SQLJ logic only has to be performed one time as shown below: 

perform some JDBC logic 

run SQLJ logic 

perform more JDBC logic 

Newly Bound Packages 

At certain points, a newly bound package may come to existence via the bind 
process, and the application may desire to use it (especially if its collection is at the fi-ont 
of the CURRENT PACKAGE PATH). The not-found entries for packages would have 
to be invalidated in the cache table in order for the subsequent search to locate and load 
in the newly bound package. For example, when the transaction is committed or when 
the thread is reused for a new connection, the package cache table's not-foimd entries can 
be cleared out. An example of this type of transaction is illustrated below. 

First; Program 1 issues: 

SET CURRENT PACKAGE PATH = A, B, C 

SELECT FROM Table 1 in user database package B. program 1 is loaded and 

cached. 
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-> A dummy not-found package 
for A.programl is also cached. 



Then, Program 2 is called which issues: 

SELECT FROM Table 2 in the user database -> package A.program2 is loaded and 
cached. 

Then return to Program 1 



Since there may be other factors affecting package resolution, such as a 
CURRENT PACKAGESET value, a precedence rule must be established. The proposed 
mechanism assumes the CURRENT PACKAGE PATH as highest in the precedence rule, 
followed by the CURRENT PACKAGESET value if turned on, followed by the static 
package list bound into the plan. 

Distributed Environment 

Figure 3 illustrates a distributed environment 100 where there is a plurality of 
database server systems 202 through 206 in accordance with the present invention. In a 
distributed environment, the CURRENT PACKAGE PATH value is location sensitive, 
that is, the order of its setting (via the SET CURRENT PACKAGE PATH SQL 
statement) and when a remote site connection is established (via the CONNECT SQL 
statement) matters. It is the responsibility of the client to propagate this value to the 
server in the event that the value is set, so that all subsequent SQL operations will utilize 
the new value. In terms of implementation in some database systems, the value flows 



INSERT INTO Tablel 



^ package B. program 1 is found in the cache and used 



COMMIT 
UPDATE Table 1 



-> cache table is cleaned up 

-> package searching is done for A.programl and may load 
if it exists 
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using a DRDA set environment command (EXCSQLSET), which is outside of the 
standard value received on the EXCSQLSET command, and this value is utilized as the 
list of package collections to search on all subsequent remote operations. 

To illustrate this in the context of an example refer now to the below description 
in conjunction with Figure 3. 

First, CONNECT TO SERVER 202, assuming server 202 is the local server. 
Then SET CURRENT PACKAGE PATH = A, B, C. Next, the server 202 uses 
CURRENT PACKAGE PATH(A,B,C) value to locate package. Next, CONNECTS TO 
server 204, SET CURRENT PACKAGE PATH = D, E, F. Then, thereafter CURRENT 
PACKAGE PATH(D, E, F) is provided to server 204 and used to locate the package 
there. Finally, CONNECT TO server 206. Server 202 sends collections A, B, and C one 
at a time to server 206 and they are used to search package. 

Accordingly, the SET CURRENT PACKAGE PATH statement provides 
improved performance for remote clients. With the CURRENT PACKAGE PATH value, 
the package precedence list is sent to the server and the server manages the package 
resolution search. Accordingly each statement only has to flow to the server once, which 
has the potential to greatly reduce network traffic in some applications. 

Advantages 

There are several key advantages to the SET CURRENT PACKAGE PATH 
statement in accordance with the present invention: 

1) The SET CURRENT PACKAGE PATH statement allows the application 
to supply a precedence list of qualified package collections during execution time, 
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therefore the scheme for resolving packages can be changed at any point during 
execution (rather than being statically established at program preparation time as in the 
conventional system). The application program has a lot more flexibility in resolving 
package names, since the package resolution scheme can change from one part of the 
application to another. 

2) The SET CURRENT PACKAGE PATH statement improves application 
development by making the package resolution mechanism less error prone and less 
tedious for the application developer. With the SET CURRENT PACKAGE PATH 
mechanism, the application developer can establish a search precedence list for package 
collections, and in many cases it is only necessary to establish this precedence list once or 
tvsdce during life of the application. This greatly reduces the complexity of managing 
package resolution for complex applications programs. Java development in SQLJ is an 
example where this advantage is particularly apparent. In Java programs, it is common to 
use both JDBC and SQLJ in a single application program. The JDBC package is 
typically in one package collection, and the SQLJ packages are typically in separate 
package collections. 

3) The SET CURRENT PACKAGE PATH statement also provides 
improved performance for remote clients. With the CURRENT PACKAGE PATH, the 
package precedence list is sent to the server and the server manages the package 
resolution search. Accordingly each statement only has to flow to the server once, which 
has the potential to greatly reduce network traffic in some applications. 

4) The SET CURRENT PACKAGE PATH statement is also able to support 
nested stored procedures and UDFs, so that package resolution for these programs can 
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happen totally independently from the package resolution used by the calling program. 
This makes it much easier for the application developer to write stored procedures or 
UDFs without concern for the caller's package resolution scheme. 

Although the present invention has been described in accordance with the 
6 embodiments shown, one of ordinary skill in the art will readily recognize that there 

could be variations to the embodiments and those variations would be within the spirit 
and scope of the present invention. Accordingly, many modifications may be made by 
one of ordinary skill in the art without departing from the spirit and scope of the 
appended claims. 
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